Messenger RNA profiling: body fluid identification using multiplex reverse transcription-polymerase chain reaction (RT-PCR)

ABSTRACT

This invention relates to a body fluids identification method and kit. A parallel, multiplex reverse transcription-polymerase chain reaction (RT-PCR) assay for the definitive identification of body fluids commonly encountered in forensic casework analysis, namely blood, saliva, semen, and vaginal secretions. The methodology is based on gene expression profiling analysis in which the body fluid-specific genes are identified by detecting the presence of appropriate messenger RNA species. Gene-specific primers are labeled with fluorescent dyes, separated and subjected to laser induced fluorescence for identification of body fluid-specific genes present in a sample stain.

This invention claims the benefit of priority from U.S. ProvisionalApplication Ser. No. 60/545,792 filed Feb. 19, 2004.

FIELD OF THE INVENTION

This invention relates to a ribonucleic acid (RNA) based assay systemand kit for body fluid identification, and in particular to a novel,multiplex, parallel assay system based on messenger RNA expressed inhuman tissue, and to a method for using the same.

BACKGROUND AND PRIOR ART

Conventional methods of body fluid identification use a variety oflabor-intensive, technologically diverse techniques that are performedin a series, not parallel, manner and are costly in terms of time andconsumption of sample. It used to be standard practice to performbiochemical, serological, and immunological tests to identify the bodyfluid(s) comprising a biological stain. Increasingly, however, classicalmethods for body fluid identification have no confirmatory technique forsome frequently encountered body fluids. For example, there is nodefinitive test for the presence of saliva or vaginal secretions, andurine identification can be problematic. The need exists for a morereliable, efficient assay system to supplant conventional methods forbody fluid identification.

Previous research in the development of a ribonucleic acid (RNA) basedassay system for the identification of body fluids, includedconsiderations for the use of protein and messenger RNA (mRNA) sinceboth are expressed in a tissue-type specific manner. However, multiplexanalysis of complex protein mixtures, such as those present in bodyfluid stains, requires further developments in the field of proteomics.Whereas, with messenger RNA (mRNA), the molecular intermediate betweengenetic DNA and expressed protein is at present, supported bytechnologies for massively parallel analysis in the field of genomics.

As reported by B. Alberts, et al. Molecular Biology of the Cell 3^(rd)ed., Garland Publishing Inc., NY, 1994, a pattern of gene expression isproduced that is unique to each cell type and is evinced by the presenceas well as the relative abundance of specific mRNAs. Each cell type,such as, blood monocytes, lymphocytes, ejaculated spermatozoa,epithelial cells lining the oral cavity or epidermal cells, has a uniquepattern of gene expression.

Specific gene expression patterns for saliva were reported by J. Juusolaand J. Ballantyne in February 2002 in a presentation to the AmericanAcademy of Forensic Science (AAFS) entitled, “The Development of an RNABased Assay System for Body Fluid Identification,” and in ForensicScience International, Vol. 135 (2003) pages 85-96 (“Messenger RNAProfiling: a Prototype Method to Supplant Conventional Methods for BodyFluid Identification”). Semen specific genes were reported by J. Juusolaand J. Ballantyne in “The Development of an RNA Based Assay System forBody Fluid Identification,” presented to AAFS, February 2002. M. Bauerand D. Patzelt also identified semen specific genes in an article,“Protamine mRNA as Molecular Marker for Spermatozoa in Semen Stains,”International Journal of Legal Medicine Vol. 117 (2003) pages 175-179.

As more and more tissue-specific genes (mRNAs) are identified for use inthe positive identification of body fluids and tissues of forensicimportance, there is an increasing need for a device or assay systemthat provides simultaneous and semi-automatic analysis through a commonassay format. The novel multiplex, parallel assay system of the presentinvention provides a common assay format and offers many advantages overthe conventional analysis procedures for body fluid identification.

SUMMARY OF THE INVENTION

A primary objective of the present invention is to provide facileidentification of the tissue components present in a body fluid stain.

A second objective of the present invention is to supplant the batteryof serological and biochemical tests currently employed in the forensicserology laboratory.

A third objective of the present invention is to provide a common assayformat that provides greater specificity in the identification of bodyfluids with improved timeliness.

A fourth objective of the present invention is to decrease sampleconsumption during analysis of stains containing body fluids.

A fifth objective of the present invention is to provide simultaneousand semi-automatic analysis through a common assay format.

A sixth objective of the present invention is to provide a multiplexanalysis of body fluids in an assay format that is compatible with DNAextraction methodologies.

A seventh objective of the present invention is to provide a kit thatcan be used to identify the source of at least four body fluids from ahuman being found in a single stain or a mixed stain.

A preferred method for preparing a multiplex, parallel assay to identifythe source of at least four body fluids from a human being includesobtaining a sample stain consisting of a body fluid from a human being,extracting total ribonucleic acid (RNA) from the sample stain, treatingthe total RNA with an enzyme, initiating a reverse-transcriptase (RT)reaction by treating total RNA with random decamers to produce cDNA,amplifying the cDNA using gene-specific primers controlling the size ofthe amplified cDNA to allow separation by electrophoresis, andidentifying body fluid-specific genes present in the sample as either asingle or a mixed stain.

The total RNA is extracted with a denaturing solution, such as,guanidine isothiocyanate-phenol:chloroform. Then, the extracted totalRNA is precipitated with an organic solvent, such as, isopropanol. Apreferred enzyme, such as, deoxyribonuclease I (DNase I) is used totreat the extracted total RNA.

The preferred multiplex assay is an octaplex composed of eight bodyfluid-specific genes optimized for the detection of body fluids fromfour specific body tissues. The fluids selected from four specific bodytissues, are identified as, blood, saliva, semen and vaginal secretions.

The gene-specific primers are labeled with fluorescent dyes and areincorporated into a single multiplexed polymerase chain reaction (PCR)and the cDNA is amplified prior to separation according to size. Apreferred method of separation is via capillary electrophoresis. Theseparated PCR product is then subjected to laser induced fluorescencefor identification of body fluid-specific genes present in a samplestain.

Eight body fluid-specific genes are used in a multiplex, parallel assayand the eight body fluid-specific genes are selected from the groupconsisting of: beta-spectrin (SPTB), porphobilinogen deaminase (PBGD),statherin (STATH), histatin 3 (H-1-N3), protamine 1 (PRM1), protamine 2(PRM2), mucin 4 (MUC4) and human beta-defensin 1 (HBD1)).

The messenger RNA genes for human blood are identified as beta-spectrin(SPTB) and porphobilinogen deaminase (PBGD). The messenger RNA genes forvaginal secretions are identified as mucin 4 (MUC4) and humanbeta-defensin 1 (HBD1).

A kit is provided for use in preparing a multiplex, parallel assay toidentify the source of at least four body fluids from a human being,comprising PCR primers pre-selected from 4 or more of the groupconsisting of 8 body fluid-specific gene primers constructed from SPTB,PBGD, STATH, HTN3, PRM1, PRM2, MUC4, and HBD-1 genes. In the kits, PCRprimers are tagged with fluorescent dyes. It is possible to redesignprimers for one or more of the body fluid-specific genes and it isunderstood that the redesigned primers are within the scope of thepresent invention.

Further objects and advantages of this invention will be apparent fromthe following detailed description and example of a presently preferredembodiment.

BRIEF DESCRIPTION OF THE DRAWING

FIG. 1 shows the sequences of the PCR primers for the various body fluidspecific genes used in the assay.

DESCRIPTION OF THE PREFERRED EMBODIMENTS

Before explaining the disclosed embodiments of the present invention indetail it is to be understood that the invention is not limited in itsapplication to the details of the particular arrangements shown sincethe invention is capable of other embodiments. Also, the terminologyused herein is for the purpose of description and not of limitation.

The invention provides a multiplex, parallel assay to identify thetissue source for at least four body fluids from a human being. Thefirst step in the development of the invention is the identification oftissue-specific genes that are expressed in only one tissue. Forexample, each tissue type is comprised of cells that have a uniquetranscriptome, or gene expression profile, also known as the messengerribonucleic acid (mRNA). The collection of genes that are expressedwithin the constellation of differentiated cells that make up a bodyfluid is called the multicellular transcriptome. These genes compriseubiquitously expressed housekeeping genes, which are responsible forcell maintenance functions, and genes that are specifically expressed incertain tissues only.

The mRNA molecules are present in different quantities depending on theparticular species of mRNA and the cell type, and can be classified asabundant, moderately abundant and rare. Although the present inventionis dependent on the identification of tissue-specific mRNA molecules, itis not to be considered a limitation of the invention. As previouslydiscussed, tissue-specific genes are still in the process of beingidentified.

In addition to identifying two body fluid-specific genes for each offour body fluids, the present invention provides a body fluididentification system or method. A number of design factors and assayquality controls ensure that the tissue-specific, RT-PCR assaysdisclosed herein detect mRNA and not genomic DNA.

The two body fluid-specific genes for each of four body fluids that arethe subject of the present invention, are identified in Table I below.

TABLE I BODY FLUID- SPECIFIC GENE: BODY FLUIDS beta-spectrin BLOOD(SPTB) porphobilinogen BLOOD deaminase (PBGD) statherin SALIVA (STATH)histatin 3 SALIVA (HTN3) protamine 1 SEMEN (PRM1) protamine 2 SEMEN(PRM2) mucin 4 (MUC4) VAGINAL SECRETIONS human beta- VAGINAL defensin 1SECRETIONS (HBD1)

Fluorescent dye-labeled primers for the eight genes identified above,are incorporated into a single multiplexed polymerase chain reaction(PCR). Following amplification, the PCR product is separated accordingto size with capillary electrophoresis and detected using laser inducedfluorescence. The resulting profile, or collection of specific peaks onthe electropherogram that represent the body fluid-specific genes,identify the body fluids that are present in the sample as a single ormixed stain. For this invention, the body fluids include, but are notlimited to, blood, saliva, semen or vaginal secretions.

The following example provides further explanation of the presentinvention.

EXAMPLE

A physiological stain is prepared by collecting human blood viavenipuncture; collecting 50 microliter (μl ) aliquots placed onto cottongauze and dried at room temperature. Freshly ejaculated semen iscollected in plastic cups, and allowed to dry onto cotton swabs at roomtemperature. Buccal scrapings and saliva samples were obtained on cottonswabs, and dried at room temperature. Vaginal secretions, such asmenstrual blood, are also obtained and dried at room temperature. ForRNA or DNA isolation a 50 microliter (μl) blood or saliva stain, or asingle semen, vaginal secretion, or menses blood stained cotton swab isused.

For RNA isolation, total RNA is extracted from blood, saliva, vaginaland semen stains with a denaturing solution, such as, guanidineisothiocyanate-phenol:chloroform and precipitated with isopropanol.Next, the extracted total RNA is treated with an enzyme,deoxyribonuclease I (DNase I), and then reverse-transcribed using randomdecamers as the first strand primer, producing complementary DNA (cDNA).Finally, the cDNA is amplified using gene-specific primers. The RT-PCRamplimer sizes are carefully chosen to span the range of approximately100 base pairs to approximately 300 base pairs, wherein base pairs isthe size of amplified product, to allow facile separation by standardelectrophoresis on agarose gels or by capillary electrophoresis.

The primers are custom synthesized commercially. Primers for PBGD, PRM1,and PRM2 are described by A. N. Gubin and J. L. Miller, in “Humanerythroid porphobilinogen deaminase exists in 2 splice variants.” Blood,97 (2001) pages 815-817, and K. Steger, K. Pauls, T. Klonisch, F. E.Franke and. M. Bergmann, “Expression of Protamine-1 and -2 mRNA duringhuman spermiogenesis” Mol. Hum. Reprod. 6 (2000) 219-225. The primersequences (and expected product sizes) are shown in FIG. 1. The assay ofthe present invention, includes, but is not limited to, the primersequences listed herein. A person skilled in the, art will readilyrecognize that it is possible to redesign primers for one or more of thegenes and it is understood that the redesigned primers are within thescope of the present invention.

The multiplex PCR reaction mixture contains buffer (10 mM Tris-HCl, pH8.3, 50 mM KCl), 3.5 mM MgCl₂, 0.125 mM each dNTP, 5.48 μM PCR primers(see above), and 1.25 units of DNA Polymerase. The PCR primer pairs areadded in the following concentrations: SPTB, 0.8 μM; PBGD, 0.24 μM,STATH, 0.4 μM; HTN3. 0.4 μM; PRM1, 0.04 μM; PRM2, 0.4 μM; HBD-1, 1.6 μM;MUC4, 1.6 μM. The standard PCR conditions used for the multiplexconsists of a denaturing step (95° C., 11 min) followed by 35 cycles(94° C., 20 sec; 55° C., 30 sec; 72° C., 40 sec) and a final extensionstep (72° C., 5 min) using a thermocycler instrument.

Based on the above extraction and separation techniques, a multiplexRT-PCR assay is developed for the definitive identification of all ofthe body fluids commonly encountered in forensic casework analysis,namely blood, saliva, semen and vaginal secretions. The octaplex iscomposed of eight body fluid-specific genes and has been optimized forthe detection of blood, saliva, semen, and vaginal secretions as singleor mixed stains. The methodology is based on gene expression profilinganalysis in which the body fluid-specific genes are identified bydetecting the presence of appropriate mRNA species. The gene-specificprimers are labeled with fluorescent dyes (such as Tet or Hex or Fam asindicated in FIG. 1) incorporated into a single multiplexed polymerasechain reaction (PCR). The cDNA is amplified using PCR, and the PCRproduct is separated according to size using a capillary electrohoresisinstrument, and detected using laser induced fluorescence. The resultingprofile identifies those body fluids present in a single or mixed stain.

Advantages of the mRNA-based approach, compared to conventionalbiochemical analysis include, but are not limited to, greaterspecificity, simultaneous and semi-automated analysis through a commonassay format, improved timeliness, decreased sample consumption andcompatibility with DNA extraction methodologies. While the invention hasbeen described, disclosed, illustrated and shown in various terms ofcertain embodiments or modifications which it has presumed in practice,the scope of the invention is not intended to be, nor should it bedeemed to be, limited thereby and such other modifications orembodiments as may be suggested by the teachings herein are particularlyreserved especially as they fall within the breadth and scope of theclaims here appended.

1. A kit for use in performing a multiplex, parallel PCR assay toidentify the source of body fluids from a human being, selected from thegroup consisting of one or more of blood, saliva, semen and vaginalsecretions, consisting of PCR primer pairs selected from the groupconsisting of 5′-agg-atg-gct-tgg-cct-tta-at (Seq ID 1),5′-act-gcc-agc-acc-ttc-atc-tt (Seq ID 2),5′-tgg-atc-cct-gag-gag-ggc-aga-ag (Seq ID 3),5′-tct-tgt-ccc-ctg-tgg-tgg-aca-tag-caa-t (Seq ID 4),5′-ctt-ctg-tag-tct-cat-ctt-g (Seq ID 5),5′-tgg-ttg-tgg-gta-tag-tgg-ttg-ttc (Seq ID 6),5′-gca-aag-aga-cat-cat-ggg-ta (Seq ID 7), 5′-gcc-agt-caa-acc-tcc-ata-atc(Seq 133 8), 5′-gcc-agg-tac-aga-tgc-tgt-cgc-ag (Seq ID 9),5′-tta-gtg-tct-tct-aca-tct-cgg-tct (Seq ID 10),5′-gtg-agg-agc-ctg-agc-gaa-cgc (Seq ID 11),5′-tta-gtg-cct-tct-gca-tgt-tct-ctt-c (Seq ID 12),5′-tcc-ctg-aaa-tcc-tga-gtg-tt (Seq ID 13), 5′-taa-cag-gtg-cct-tga-att-tt(Seq ID 14), 5′-gga-cca-cat-ttt-atc-agg-aa (Seq ID 15),5′-tcg-aga-aac-agg-gca-tag-ga (Seq ID 16).
 2. A kit, as in claim 1,wherein said primers are tagged with fluorescent dyes.
 3. A kit, as inclaim 1, wherein the primers for identifying the source of body fluidsconsist of one or more of the group consisting of (SPTB primers)5′-agg-atg-gct-tgg-cct-tta-at (Seq ID 1), 5′-act-gcc-agc-acc-ttc-atc-tt(Seq ID 2), (STATH primers) 5′-ctt-ctg-tag-tct-cat-ctt-g (Seq ID 5),5′-tgg-ttg-tgg-gta-tag-tgg-ttg-ttc (Seq ID 6), (HTN3 primers)5′-gca-aag-aga-cat-cat-ggg-ta (Seq ID 7), 5′-gcc-agt-caa-aac-tcc-ata-atc(Seq ID 8), (HBD-1 primers) 5′-ttc-ctg-aaa-tcc-tga-gtg-tt (Seq ID 13),5′-taa-cag-gtg-cct-tga-att-tt (Seq ID 14), (MUC4 primers)5′-gga-cca-cat-ttt-atc-agg-aa (Seq ID 15), 5′-tag-aga-aac-agg-gca-tag-ga(Seq ID 16).
 4. A kit, as in claim 3, wherein SPTB primer5′-agg-atg-gct-tgg-cct-tta-at (Seq ID 1) is labeled with 6-FAM, PBGDprimer 5′-tgg-atc-cct-gag-gag-ggc-aga-ag (Seq ID 3) is labeled with TET,STATH primer 5′-ctt-ctg-tag-tct-cat-ctt-g (Seq ID 5) is labeled with6-FAM, HTN3 primer 5′-gca-aag-aga-cat-cat-ggg-ta (Seq ID 7) is labeledwith 6-FAM, PRM1 primer 5′-gcc-agg-tac-aga-tgc-tgt-cgc-ag (Seq ID 9) islabeled with HEX, PRM2 primer 5′-gtg-agg-agc-ctg-agc-gaa-cgc (Seq ID 11)is labeled with 6-FAM, HBD-1 primer 5′-ttc-ctg-aaa-tcc-tga-gtg-tt (SeqID 13) is labeled with 6-FAM, and MUC4 primer5′-gga-cca-cat-ttt-atc-agg-aa (Seq ID 15) is labeled with HEX.
 5. A kit,as in claim 3, wherein said PCR primers are tagged with fluorescentdyes.
 6. A kit, as in claim 1, wherein said kit further comprises anenzyme to facilitate a PCR reaction.
 7. A kit, as in claim 6, whereinsaid enzymes is DNA Polymerase.
 8. A kit, as in claim 6, furthercomprising buffers designed to facilitate said multiple parallel assay.9. A kit, as in claim 8, wherein said buffer comprises 10 mM Tris-HCl,pH 8.3, 50 mMKCl, and 1.5 mM MgCl2.
 10. Primers for identifying thesource of body fluids consisting of one or more of the group consistingof, (STATH primers) 5′-ctt-ctg-tag-tct-cat-ctt-g (Seq ID 5),5′-tgg-ttg-tgg-gta-tag-tgg-ttg-ttc (Seq ID 6), (HTN3 primers)5′-gca-aag-aga-cat-cat-ggg-ta (Seq ID 7), 5′-gcc-agt-caa-acc-tcc-ata-atc(Seq ID 8), (HBD-1 primers) 5′-ttc-ctg-aaa-tcc-tga-gtg-tt (Seq ID 13),5′-taa-cag-gtg-cct-tga-att-tt (Seq ID 14), (MUC4 primers)5′-gga-cca-cat-ttt-atc-agg-aa (Seq ID 15), 5′-tag-aga-aac-agg-gca-tag-ga(Seq ID 16).
 11. The Primers for identifying the source of body fluidsas in claim 10, wherein said primers are tagged with fluorescent dyes.12. Primers for identifying the source of body fluids as wherein PBGDprimer 5′-tgg-atc-cct-gag-gag-ggc-aga-ag (Seq ID 3) is labeled with TET,STATH primer 5′-ctt-ctg-tag-tct-cat-ctt-g (Seq ID 5) is labeled with6-FAM, HTN3 primer 5′-gca-aag-aga-cat-cat-ggg-ta (Seq ID 7) is labeledwith 6-FAM, PRM1 primer 5′-gcc-agg-tac-aga-tgc-tgt-cgc-ag (Seq ID 9) islabeled with HEX, PRM2 primer 5′-gtg-agg-agc-ctg-agc-gaa-cgc (Seq ID 11)is labeled with 6-FAM, HBD-1 primer 5′-ttc-ctg-aaa-tcc-tga-gtg-tt (SeqID 13) is labeled with 6-FAM, and MUC4 primer5′-gga-cca-cat-ttt-atc-agg-aa (Seq ID 15) is labeled with HEX.